Privee: An Architecture for Automatically Analyzing Web Privacy Policies
نویسندگان
چکیده
Privacy policies on websites are based on the noticeand-choice principle. They notify Web users of their privacy choices. However, many users do not read privacy policies or have difficulties understanding them. In order to increase privacy transparency we propose Privee—a software architecture for analyzing essential policy terms based on crowdsourcing and automatic classification techniques. We implement Privee in a proof of concept browser extension that retrieves policy analysis results from an online privacy policy repository or, if no such results are available, performs automatic classifications. While our classifiers achieve an overall F-1 score of 90%, our experimental results suggest that classifier performance is inherently limited as it correlates to the same variable to which human interpretations correlate—the ambiguity of natural language. This finding might be interpreted to call the notice-and-choice principle into question altogether. However, as our results further suggest that policy ambiguity decreases over time, we believe that the principle is workable. Consequently, we see Privee as a promising avenue for facilitating the notice-and-choice principle by accurately notifyingWeb users of privacy practices and increasing privacy transparency on the Web.
منابع مشابه
A Taxonomy for Web Site Privacy Requirements
Privacy has recently become a prominent issue in the context of electronic commerce Web sites. Increasingly, privacy policies posted on such Web sites are receiving considerable attention from the government and consumers. In this paper we present a taxonomy for Web site privacy requirements. We have used goal-mining, the extraction of pre-requirements goals from post-requirements text artifact...
متن کاملSearching for Privacy: Design and Implementation of a P3P-Enabled Search Engine
Although the number of online privacy policies is increasing, it remains difficult for Internet users to understand them, let alone to compare policies across sites or identify sites with the best privacy practices. The World Wide Web Consortium (W3C) developed the Platform for Privacy Preferences (P3P 1.0) specification to provide a standard computer-readable format for privacy policies. This ...
متن کاملAgents of Choice: Tools that Facilitate Notice and Choice about Web Site Data Practices
A variety of tools have been introduced recently that are designed to help people protect their privacy on the Internet. These tools perform many different functions including encrypting and/or anonymizing communications, preventing the use of persistent identifiers such as cookies, automatically fetching and analyzing web site privacy policies, and displaying privacyrelated information to user...
متن کاملAutomatic policy enforcement on semantic social data
Web-based data collection of non-reactive data is becoming increasingly important for many social science fields. Being able to introduce and automatically enforce policies that regulate the collection and the use of those data is crucial for taking into account the privacy and confidentiality wishes of data providers. Those policies are currently expressed in natural language or in a language ...
متن کاملThe Usable Privacy Policy Project: Combining Crowdsourcing, Machine Learning and Natural Language Processing to Semi-Automatically Answer Those Privacy Questions Users Care About
Natural language privacy policies have become a de facto standard to address expectations of “notice and choice” on the Web. However, users generally do not read these policies and those who do read them struggle to understand their content. Initiatives aimed at addressing this problem through the development of machine-readable standards have run into obstacles, with many website operators sho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014